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Several newly defined peptides and epitopes are provided. Also, the finding can explain the formation of neo-epitopes in autoimmune 
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Title: Novel epitopes for celiac disease and autoimmune diseases, methods 
for detecting those and novel non-antigenic food compounds. 

The invention relates to the field of molecular biology and immunology, more 
specific to immune diseases, especially celiac disease. It further relates to neo* 
epitopes that can be generated by the enzyme tissue transglutaminase (tTG) 
which neo-epitopes can play a role in celiac disease and in autoimmune 
5 diseases. 

Introduction 

Celiac disease (CD) is a permanent intolerance to gluten (Marsh, M.N., 
Gastroent. 102 : 330-354, 1992). Gluten is a complex mixture of proteins found 

10 in wheat and several other grains, and is present in many food products. 
Typical disease symptoms include chronic diarrhoea, fatigue, and failure to 
thrive. These symptoms are associated with a lesion in the upper small 
intestine, characterised by a (sub) total villous atrophy, an increased number 
of intra epithelial lymphocytes, and a chronic inflammatory response of the 

15 lamina propria lymphocytes (Marsh, 1992). Diagnosis is based on morphology 
of a small intestinal biopsy and improvement of the clinical status after a 
gluten free diet. A strong indication for CD is the presence of antibodies 
specific for the enzyme tissue transglutaminase (tTG) in patients (Dieterich, 
W. et a/., Nat. Med, 3: 797-801, 1997). Moreover, recent studies demonstrated 

20 the involvement of this enzyme in the generation of T cell stimulatory peptides 
(Molberg. O. et aZ., Nat. Med. 4: 713-717. 1998; van de Wal, Y. et al, J. 
Immunol. 161: 1585-1588, 1998a). CD is strongly associated with HLA-DQ2 
(Al*0501, Bl*0201) and HLA-DQS (Al*0301, Bl*0302) (Spurkland A. et aZ., 
Tissue Antigens 49: 29-34, 1997). In addition, HLA-DQ2 and/or HLA-DQ8 

25 restricted, gluten specific CD4* T lymphocytes can be isolated from small 
intestinal biopsies of CD patients, and these are thought to cause disease 
(Lundin, K. et oZ.. J. Exp. Med. 178: 187-196, 1993; van de Wal, Y. et al Proc. 
Natl. Acad. Sci. USA 95: 10050-10054, 1998b). The peptide binding motif of 
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HLA-DQ2 and -DQ8 predicts a preference for negative charges at anchor 
positions in the bound peptides (Johansen, B.H. et a/., Int. Immunol. 8: 177- 
182, 1996; van de Wal. Y. et al Immunogenetics 44: 246-253, 1996; Kwok. 
W.W. et a/., J. Immunol. 156: 2171.2177. 1996). Gluten molecules, however, 

5 contain few negative charges. This discrepancy was solved by the finding that 
tTG can convert glutamine residues into glutamic acid (a process termed 
deamidation), and thus introduces negative charges in gluten peptides. Indeed 
it was found that tTG is required for, or enhances the gluten specific response 
of T cell clones and T cell lines derived from the majority of CD patients (van 

10 de Wal. 1998a; Molberg, 1998; Arentz-Hansen, H. et al, J. Exp. Med. 191: 603- 
612, 2000; Anderson, R.P. ei al, Nat. Med. 6: 337-342, 2000). Considering the 
, abundance of glutamine residues in gluten (-30-40%) tTG has many potential 
target sites in gluten. Mass spectral analysis of the deamidated T cell 
stimulatory gluten peptides, however, showed a highly restricted pattern of 

15 deamidation (van de Wal. 1998a; Molberg, 1998). Usually the pattern of 

deamidation coincides with the positions where negative charges are preferred 
in the HLA binding motif, thus favouring the binding of gluten peptides to 
HLA-DQ2 and/or --DQS. The specificity of gluten deamidation by tTG, 
therefore, is a crucial factor in the generation of toxic gluten peptides. It is not 

20 known, however, why only particular glutamine residues are targeted by tTG. 

Current therapy of CD mainly involves dietary treatment of glutensensitive 
patients with diets lacking cereal compounds such as flour, which deprives 
these patients of such typical staple foods as for instance bread. The criteria 

25 for glutenfree products are established by the Codex Alimentarius Committe 
"Nutrition and Food for Special Dietary Uses" that meets every 1.5 year. The 
current criterion is based on the determination of nitrogen. To be considered 
glutenfree a product may contain maximally 50 mg N per 100 gram of product. 
The determination is only useful when there is a certain relationship between 

30 the amount of nitrogen and the amount of gluten. This is only true, to a certain 
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extent, for wheat. There is still a need for a more reliable method to detect 
whether food materials would cause problems in CD patients. 

For auto-immune diseases in general, it is not known which combination of 
5 factors gives rise to an immunological response. One of the hypotheses is that 
presumed antigenic compounds are processed in the human body and give rise 
to so-called neo-epitopes. These neo-epitopes are then recognised by the 
immune system and disease occurs. It is, however, not yet clear in which way 
these neo-epitopes can be formed. 

10 

Summary of the invention 

The invention now provides a. peptide sequence of 14-40 amino acids, which is 
prone to deamidation by tTG and which is a causative factor of celiac disease, 

15 which comprises an amino acid sequence selected from the group consisting 
essentially of QQPYPQQPQQPFPQ, QQPFPQQPQQPFPQ, 
PFPQTQQPQQPFPQ, PFPQLQQPQQPFPQ, QFPQTQQPQQPFPQ. 
QLPFPQQPQQPFPQ, PFPQPQQPQQPFPQ and PFPQSQQPQQPFPQ. Also, 
the invention provides for epitopes made by deamidation of these peptides, 

20 thus resulting in the sequences QQPYPEQPQQPFPQ, QQPFPEQPQQPFPQ, 
PFPQTEQPQQPFPQ, PFPQLEQPQQPFPQ, QFPQTEQPQQPFPQ, 
QLPFPEQPQQPFPQ, PFPQPEQPQQPFPQ, PFPQSEQPQQPFPQ, 
QQPYPEQPEQPFPQ, QQPFPEQPEQPFPQ, PFPQTEQPEQPFPQ, 
PFPQLEQPEQPFPQ, QFPQTEQPEQPFPQ, QLPFPEQPEQPFPQ, 

25 PFPQPEQPEQPFPQ, PFPQSEQPEQPFPQ, QQPYPQQPEQPFPQ, 
QQPFPQQPEQPFPQ, PFPQTQQPEQPFPQ, PFPQLQQPEQPFPQ. 
QFPQTQQPEQPFPQ. QLPFPQQPEQPFPQ, PFPQPQQPEQPFPQ and 
PFPQSQQPEQPFPQ. wherein further Q residues may be deamidated into an 
E residue. 
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The invention also provides for peptide sequences, which are prone to 
deamidation by tTG and which are a causative factor of an autoimmune 
disease selected from the group consisting essentially of RA (rheumatoid 
arthritis). MS (multiple sclerosis), SLE (systemic lupus erythematosus), SS 

5 (Sjogren syndrome) and DB (diabetes), which peptides are shown in table 5. 
Preferably these peptide sequences are residing in sequences 8-40 amino acid 
residues which comprise at least one 8-mer which is a subsequence of at least 
one of the sequences listed in claim 3, wherein said 8-mer comprises the 
central Q residue of the sequence listed in table 5. The invention also harbours 

10 the epitope formed by deamidation of one or more of the Q residues of these 
peptides. 

A further part of the invention is an isolated HLA-DQ restricted T-cell capable 
of recognising a gluten-derived epitope according to the invention and more 
specifically an isolated or recombinant HLA-DQ restricted T-cell receptor 
15 capable of recognising such an epitope. 

Also encompassed are antibodies reactive with a gluten-derived peptide or 
epitope according to the invention. 

Further disclosed in the invention is a method to screen foodstuffs or plant 
material for the occurrence of these peptides or epitopes using such a T cell, a 
20 T cell receptor or antibody. Preferably in such a method the foodstuff or plant 
material is pretreated with a protease such as trypsin, chymotrypsin or pepsin. 
Most preferably in such a method the foodstuff or plant material is 
additionally treated with tTG. 

Furthermore part of the invention is a diagnostic kit comprising: a T cell, a T 
25 cell receptor or an antibody as mentioned above and a suitable means of 

detection especially for detecting in food, food components, plant material or 
biological samples the presence of a gluten-derived peptide or epitope. 
Further part of the invention is an oligonucleotide primer set suitable for use in 
an amphfication assay which is capable of binding to a nucleotide sequence 
30 coding for a gluten derived peptide or epitope or a DNA sequence which codes 
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for a peptide which matches a consensus sequence according to the following 
formula (1). (2). (3), (4) or (5): 

(1) Xi . X2 - Xa - Q4 - - Pg - Q? - Xa - P9 - Xio 

in which Xio is F, Y, M, W, I or L and Xi, X2, X3, X5, and Xg are any amino 
5 acid; 

(2) Xi - X2 - X3 - Q4 - Xs - Pe - Q7 - Xa - Xo - Xio 

in which X9 is F, Y, M, W, I or L and Xi, X2. X3, Xs, Xs and Xio are any amino 
acid; 

(3) Q-Xi-P in which Xi is any amino acid; 

10 (4) Q - Xi - P - X2 in which Xi is any amino acid and X2 is FXW,M,L,I. 
orV; 

(5) Q - Xi - X2 - X3 in which Xi and X2 are any amino acid and X3 is 
F,Y.W,M.L,I,orV. 

Said primer set is preferentially used in a nucleotide amplification assay 
15 wherein food, food components, plant material or biological samples are tested 
for the presence of proteins or peptides which match any of the consensus 
sequences. 

The hereinbefore mentioned detection methods may be used for the detection 
of cereals which are better suited to be used in food for celiac disease patients. 

20 It is also part of the invention to produce cereals which are better suited to be 
used in food for celiac disease patients by selecting cereals having a low 
amount of proteins or peptides matching the consensus sequences and 
crossbreeding them to produce cereals with even lower amounts. 
Alternatively, the cereals can be produced by using genetic engineering 

25 techniques to diminish the natural amount of proteins or peptides matching 
the consensus sequences. Preferably the genetic engineering comprises 
transformation with recombinant DNA, most preferably homologous 
recombination or gene silencing, or induced mutation. 
The preferred cereal is wheat. 
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Also part of the invention are cereal plants thus obtained and the use of such a 
cereal to prepare food for celiac disease patients. 

Another part of the invention is a method to identify neo-epitopes from auto- 

5 antigens comprising comparing the sequence of an auto-antigen and searching 
for peptide sequences which match with the consensus sequences as defined 
before and subsequently screen biological material obtained from auto-immune 
disease patients for antibodies which bind to the peptide sequences or T cells 
which bind to the HLA-bound peptide sequences as identified in the previous 

10 step or peptides which are formed from said peptide sequences by deamidation 
of one or more Q residues- 
Further, the invention comprises a method to inhibit the binding of a T cell 
receptor to an HLA-bound gluten-derived epitope or to an HLA-bound auto- 

15 antigen derived epitope comprising providing a blocking substance, preferably 
by blocking the binding of said epitopes to the HLA molecules. 
Also comprised in the invention are analogues of the epitopes of the invention 
which are an antagonist for the activity of T cells recognising said epitope. 
A further part of the invention is a pharmaceutical composition comprising a 

20 peptide or an epitope according to the invention, which may preferably be used 
for the induction of tolerance or for the treatment of celiac disease or auto- 
immune disease. 

A next part of the invention is a method to decrease sensitivity to auto- 
antigens by inhibiting the deamidation of said auto-antigen, preferably by 
25 inhibiting the function of tTG and most preferably at the site of presence of the 
auto-antigen. 

Also part of the invention is the use of a protease inhibitor for preventing the 
generation of a peptide according to the invention or a polypeptide comprising 
such a peptide. For the similar purpose acid neutraUsing substances can be 
30 used. Further, the invention provides for a method to reduce the production of 
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a auto-antigen derived peptide by inhibiting the expression of the auto-antigen 
from which it is derived. 

Legend to the figures 
5 Figure 1. The influence of amino acid substitutions on deamidation of Qaos and 

Q216. 

A. Simultaneous mass spectral analysis of four substitution analogs of the 
gliadin peptide before and after treatment with tTG as indicated. The 

10 untreated substitution analogs with a V, D, E, or F at position Qaos+S have a 
mass of 1313, 1329, 1343 and 1361 Da respectively. After treatment with tTG 
two of those masses increase 1 Da, 1313 to 1314 and 1361 to 1362, indicating 
the conversion of a glutamine residue (128 Da) to a glutamic acid (129 Da) in 
the peptides with a V and F at position Q208+3. 

15 

B. Overview of the influence of C-terminal amino acid substitutions on 
deamidation of Q208 in the short version of the gliadin peptide and Q216 in the 
long version of the peptide. The amino acids G209, S210, F2n, N217, and Q219 were 
substituted for all amino acids except lysine and cysteine and the effect on 

20 deamidation was determined. Substitution at other position in the peptide had 
only minor effects on deamidation of the Q208- and Q2i6-residues (not shown). (- 
) indicates no effect of the substitution; Substitutions that resulted in major 
differences in deamidation are designated with arrows: decrease of 70- 
100%, i decrease of 30-69%. 

25 ^ Substitution analogs were made in which the amino acids indicated were 

replaced by every natural amino acid except cysteine and lysine, because these 
residues could affect deamidation by formation of disulphide bridges and tTG 
driven cross-linking, respectively. 

30 Figure 2. Stimulation of gluten specific T cell lines by predicted peptides 
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A. A gluten specific T cell line isolated from a small intestinal biopsy of a celiac 
disease patient was tested against gluten, tTG-treated gluten and against the 
predicted peptides that were present in pools of five peptides each (pools 1-14). 
5 The odd numbers represent pools that contain the native peptides, whereas 
the even numbers represent corresponding pools that were treated with tTG. 
Strong reactivity of the T cell line was only observed with tTG-treated gluten 
and tTG-treated pools of peptides predicted by the XXXQXPQXPY algorithm. 

10 B. Reactivity of the T cell Une against the individual peptides firom the T cell 
stimulatory pools. Bars indicated with an asterisk are considered positive 
(Stimulation Index > 3). 

Figure 3 

15 

tTG inhibitor 
Figure 4 

20 Inhibition of tTG induced deamidation of a gluten peptide by the inhibitor 
depicted in Figure 3. 

Detailed description 

25 In celiac disease both T-cells reactive with (epitopes derived from) gluten and 
antibodies reactive with the enzyme tTG (tissue transglutaminase) have been 
found. It is hypothesized that the epitopes from gluten are the causative factor 
of the disease. The antibodies specific for the endogenous tTG can be explained 
because the enzyme crosslinks with gluten proteins or parts of those, which 

30 complex then will be recognized by a B-cell expressing tTG specific antibodies 
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on its cell surface. The complex, including the gluten will then be processed by 
the B-cell to present the antigenic determinants to gluten specific T-cells, 
thereby delivering specific T-cell help, resulting in the secretion of tTG- 
specific antibodies. 

5 

We have now determined the patterns of deamidation in gluten, and find that 
this is highly dependent on the spacing between the glutamine and proline 
residues. This knowledge can be used to predict novel T cell stimulatory gluten 
peptides. Moreover, our results provide an explanation for the toxicity of 
10 gluten and the related hordeins in barley and the secalins in rye, while other 
food proteins, in particular the avenins in oats, are not toxic for CD patients. 
Also, the finding can explain the formation of neo-epitopes in autoimmune 
diseases such as RA (rheumatoid arthritis), MS (multiple sclerosis), SLE 
(systemic lupus erythematosus), SS (Sjogren syndrome) and DB (diabetes). 

15 

It has been known in the prior art that the tTG enzyme affects the 
deamidation of a glutamine (Gin; lUPAC code: Q) moiety on the target peptide 
into a glutamic acid moiety (Glu; TUPAC code: E). Gluten proteins, such as 
gliadin. glutelin, hordein and secalin, however, contain very many Q residues 

20 (in gliadin approximately 30-40% of all amino acid residues) and it has been 
shown that not all Q residues are susceptible to deamidation by tTG in the 
same manner. Thus, it is clear that the near environment of the Q residue on 
the target protein (be it in the primary, secondary or tertiary structure of the 
protein) influences the availability of the Q residue to act as a substrate for the 

25 enzyme. It has been shown earlier (van de Wal, 1998a) that in one of the 
gliadin peptide parts which is able to stimulate T-cells 
(PSGQ208GSFQPSQQ216NPQA) the Q residues at position 208 and 216 are 
deamidated, while the Q residues at positions 212, 215 and 219 are less or not 
amidated. From detailed studies as are shown in the experimental part it has 

30 now been established that the presence of proline (Pro, lUPAC code P) in the 
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neighbourhood of the Q residue is a key determining factor of the susceptibility 
of said residue to deamidation by tTG, In the sequences QP and QXXP (X is 
any amino acid) no or very Uttle deamidation of the Q-residue was observed. In 
contrast, in the sequences QXP, QXXF(Y, W, L, I, M or V) and QXPF(Y,W, L, I, 
5 M, or V) efficient deamidation of the Q-residue was observed. Also, a minimum 
of three amino acids C-terminal of the target Q-residue is required for 
deamidation. Based on this finding two consensus sequences have been 
developed which have a very high predictive value for susceptibility to tTG 
catalysed deamidation. 

10 On the basis of these results two search algorithms were defined, specifically 
aimed to aid in the search for HLA-DQ2-binding peptides in the gluten 
databases that would be a potential target for gluten specific T cells isolated 
from ceUac disease patients. In these algorithms the deamidation of Q redidues 
at position p7 and/ p4 is predicted to mediate binding to HLA-DQ2. The first 

15 algorithm predicts deamidation at p4 and p7 and reads as follows: 

(1) Xi -X2 • X3 . Q4 - Xs - Pe- Q? • Xs - Pd • Xio 

in which Xw is F, Y, M, W, I or L and Xi, X2. X3, X5, and Xs are any amino 
20 acid. 

The second algorithm predicts deamidation at p4 and reads as follows: 

(2) Xi - X2 • Xs - Q4 • Xfi • Pe . Q7 - Xs - Xo - Xio 

25 

in which X9 is F, Y, M, W, I or L and Xi, X2, X3, X5, Xa and Xio are any amino 
acid. 

On basis of these algorithms (also called herein consensus sequences) novel 
peptide parts of gluten proteins have been discovered, which are able to act as 
30 substrate for tTG and which, upon deamidation, have antigenic activity as 
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measured by their ability to stimulate gluten specific T cell lines. From hereon 
these novel peptides will be called gluten-derived peptides. These peptides 
have the amino acid sequences: QQPYPQQPQQPFPQ, QQPFPQQPQQPFPQ, 
PFPQTQQPQQPFPQ, PFPQLQQPQQPFPQ, QFPQTQQPQQPFPQ, 
5 QLPFPQQPQQPFPQ, PFPQPQQPQQPFPQ and PFPQSQQPQQPFPQ. 

It should be clear that also C-terminally and N-terminally extended peptides 
comprising the herebefore mentioned sequences are part of the invention. 
Thus, the invention comprises amino acid sequences of 14-40 amino acids 
which comprise the above mentioned sequences. 

10 Also part of the invention are the epitopes derived from these gluten-derived 
peptides by deamidation of one or more Q residues. These gluten-derived 
epitopes thus have the amino acid sequences: QQPYPEQPQQPFPQ, 
QQPFPEQPQQPFPQ, PFPQTEQPQQPFPQ, PFPQLEQPQQPFPQ, 
QFPQTEQPQQPFPQ, QLPFPEQPQQPFPQ, PFPQPEQPQQPFPQ, 

15 PFPQSEQPQQPFPQ, QQPYPEQPEQPFPQ, QQPFPEQPEQPFPQ, 
PFPQTEQPEQPFPQ, PFPQLEQPEQPFPQ, QFPQTEQPEQPFPQ, 
QLPFPEQPEQPFPQ, PFPQPEQPEQPFPQ, PPPQSEQPEQPFPQ, 
QQPYPQQPEQPFPQ, QQPFPQQPEQPFPQ, PFPQTQQPEQPFPQ, 
PFPQLQQPEQPFPQ, QFPQTQQPEQPFPQ. QLPFPQQPEQPPPQ, 

20 PFPQPQQPEQPFPQ and PFPQSQQPEQPFPQ. wherein further Q residues 
may be deaminated into an E residue. 

Furthermore, it is known from the prior art that neo-epitopes derived from 
presumed antigenic auto-immunogens can be a mechanism to explain the 

25 occurence of major auto-immune diseases such as RA (rheumatoid arthritis), 
MS (multiple sclerosis), SLE (systemic lupus erjrthomatosus), SS (Sjogren 
syndrome) and DB (diabetes). Thus far, it has not been recognised which factor 
is responsible for the existence of these neo-epitopes» Surprisingly now, when 
the amino acid sequences of the antigens which are deemed to play a role in 

30 the above mentioned auto-immune diseases are screened with the following 
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consensus sequence(s): QXiP, QX1PX2 or QX1X1X2 (wherein Xi is any amino 
acid and X2 is F. Y, W, M, L, I or V) or the sequences as described above in (1) 
or (2) in all antigens listed in Table 5 peptide sequences are found which are 
prone to deamidation by tTG. It is now postulated that in these diseases tTG is 
5 able to recognise the antigens or processed antigens and that the peptide part 
which is recognised has been deamidated and will function as a neo-epitope 
which will evoke a T-cell response. 

Table 5 shows the antigens and the amino acid sequences comprised in these 
antigens which are thought to be involved in the above mentioned auto- 
10 immune diseases. Thus, a further aspect of the invention comprises the auto- 
antigen derived peptides listed in Table 5, including the peptides in which the 
Q residue is replaced by an E at position 10 (the central Q residue). 

To a person skilled in the art it is known that peptides of variable length can 
15 bind to HLA-molecules. The relevant peptides can therefore be longer or 

shorter versions as those shown and might even extend to N-terminally and/or 
C-terminally extended sequences that are not shown but present in the 
proteins from which these peptide sequences are derived. Thus, the invention 
comprises amino acid sequences of 8 amino acid residues (8-mers) derivable 
20 from and comprising the central Q residue of those sequences as listed in Table 
5. For example then (taking the first sequence of Table 5: 
SRFSWGAEGQRPGFGYGGR) the peptides of the invention would include the 
foUowing 8-mers: FSWGAEGQ, SWGAEGQR, WGAEGQRP, GAEGQRPG, 
AEGQRPGF, EGQRPGFG, GQRPGFGY and QRPGFGYG. Further comprised 
25 in the invention are all peptides of 40 amino acid residues or less than 40, 
which comprise one or more of the above mentioned 8-mers. For ease of 
reference these peptides further are denominated as auto-antigen derived 
peptides. 
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Analogous to the situation with the gluten-derived peptides, also here in case 
of the auto-antigen derived peptides, neo-epitopes can be defined in which the 
central Q residue in the peptides listed in Table 5 has been deamidated into an 
E residue. 

5 

Since these consensus sequences now make it possible for the first time to 
predict amino acid sequences which upon deamidation by tTG act as epitopes 
in ceUac disease or other auto-immune diseases, the information can also be 
used in various assays and therapies relating to these diseases, 

10 

The most promising application in the area of celiac disease is to screen 
foodstuffs for the presence of sequences matching with the consensus 
sequence(s). 

15 A first embodiment for such a screening approach which is included in the 
invention is the use of (HLA-DQ2 and HLA-DQ8) restricted T cells which 
recognise the gluten-derived epitopes, or the receptor of such T cells. Such T 
cells are obtainable by methods well known to those skilled in the art. A 
preferred way to obtain these T cells is to collect T cell biopsies fi:om celiac 

20 disease patients, which show typical clinical symptoms and/or are responsive 
in an anti-endomysium test. These biopsies then are cultured with either a 
(chymo)trypsin/pepsin digest of gluten optionally additionally treated with 
tTG. Cultures that show evidence of T cell proliferation are expanded and 
tested for specificity to the gluten-derived epitopes of the invention. From 

25 these gluten specific T cells optionally clones can be generated and the T cell 
receptor of such T cell clone can be cloned and transfected or transduced into 
other appropriate cells or cell Unes to yield a functional T cell receptor. 
These T cells or T cell receptors can be used to screen foodstuffs for the 
presence of epitopes. They can also be used to screen plant material, especially 

30 varieties of cereals such as wheat, to check the occurrence of said epitopes. 
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Preferably in these types of assays the protein material in the food or in the 
plant varieties is treated with a protease such as tr5T)sin, chymotrypsin or 
pepsin and further treated with the enzyme tTG to mimic the protein 
degradation in the human body. 
5 Preferably such a screening is performed with a panel of T cells having specific 
reactivity to different epitopes. In this way the safety of food for celiac disease 
patients can be tested reliably. Also in this way it is possible to detect varieties 
of cereals which would be suitable for incorporation into food products for 
celiac disease patients. 

10 

Next to the use of specific T ceUs another embodiment comprised in the 
invention for assaying the safety of foodstuffs or the suitability of cereal 
varieties for food processing is to use antibodies which are capable to bind the 
gluten-derived peptides or gluten-derived epitopes of the invention. Such 

15 antibodies are for instance obtainable by immunising an immunocompetent 
animal with the gluten-derived peptides or gluten-derived epitopes according 
to the invention or an immunogenic firagment thereof, coupled to a suitable 
carrier protein if necessary, and harvesting polyclonal antibodies from said 
animal, or obtainable by other methods known in the art such as by producing 

20 monoclonal antibodies, or (single chain) antibodies or binding proteins 

expressed from recombinant nucleic acids derived from a nucleic acid library, 
for example obtainable though phage display techniques. 
With such an antibody, the invention also provides for an immunoassay 
comprising an antibody according to the invention. Various types of 

25 immunoassays are available within the art, for example BLISA (Enzyme 
Linked Immuno Sorbent Assay) or Western blotting. 

Analogous to assays with specific T cells as discussed above, also in the case of 
antibodies preferably a panel of antibodies is used. 
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A further example for assaying the safety of food and to identify cereal 
varieties that lack the predicted T-cell stimulatory sequences is the use of 
nucleotide primer based amplification technology. In such a method 
oligonucleotide primers are developed corresponding with DNA coding for the 
5 specific gluten-derived peptides or the general amino acid consensus 

sequence(s). These primers then are brought into contact with DNA or RNA 
isolated from the food or the plants and the mixture is subjected to a treatment 
to amplify the nucleotide sequences to which the primers bind. These DNA or 
RNA amplification techniques, such as PGR and NASBA, are known and 

10 readily available to a person skilled in the art. The amplification step will 
result in a specific multiplication of the nucleotide firagment to which the 
primers bind and, in the present case, thus is indicative of the occurrence of 
even a single copy of the nucleotide coding for a gluten-derived peptide or a 
peptide which matches the consensus sequence. 

15 It is acknowledged that to develop an assay testing for the safety of food for 
celiac disease patients more than one T cell clone, more than one set of primers 
or more than one antibody is needed. In yet another embodiment the invention 
thus comprises assays having an array of antibodies as discussed above, aU 
recognising different sequences, which fall within the consensus sequence(s) of 

20 the invention. Similarly, the invention also provides for a multiple of 

oligonucleotide primer sets which are able to bind with different nucleic acids 
coding for amino acid sequences which fall within the consensus sequence(s) of 
the invention. 

25 In yet another embodiment the invention provides a diagnostic kit 

comprising an antibody according to the invention or a set of oligonucleotide 
primers according to the invention and a suitable means of detection. Such a 
diagnostic kit is, for example, very useful for detecting in food, food 
components or samples from (suspected) patients the presence of a gluten- 

30 derived peptide according to the invention or a peptide sequence involved in 
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food-related immune enteropathy (for example: celiac sprue, tropical sprue, 
giardiasis or food allergies of childhood) matching the consensus sequence(s) of 
the invention. At present such a quantitative and quaUtative diagnostic kit 
determining the presence and/or amount of gluten-derived peptide is not 
5 available. Currently two different assays are used for gluten detection. One 
assay determines the nitrogen content of food (components) as a measure for 
the presence of gluten. The other assay utilises gluten specific antibodies in 
ELISA systems. However, both assay systems do not test for the toxic gluten- 
derived peptides involved in food-related immune enteropathy. A diagnostic kit 

10 comprises, for example, an antibody according to the invention specifically 
recognising a toxic gluten-derived peptide involved in food-related immune 
enteropathy. Another advantage of the diagnostic kit as described in the 
present application is the capability of testing food (components) which cannot 
be tested or cannot be tested reliably by the currently used gluten assays. The 

15 existing assays are hardly informative when food (components) contain 
significant amounts of other nitrogen containing compounds (e.g. other 
proteins) or when food (components) contain partially hydrolysed gluten 
proteins that are not recognised by antibodies cmrrently used in ELISA-kits. 
Examples of food (components) for which the existing assays are troublesome 

20 are beer, melassis and soy sauce. In addition, the existing assays lack the level 
of sensitivity required for many applications. 

Preferably a diagnostic kit according to the invention uses different 
kinds of T cells, T cell receptors, antibodies or oligonucleotide primer sets 
according to the invention or different antibodies, each capable of recognizing 

25 another gluten-derived peptide involved in food-related immune enteropathy. 
Thereby multiple gluten-derived peptides involved in food-related immune 
enteropathy are detected. In the art different kinds of means of detection are 
available and the skilled person knows how to select a proper means of 
detection. Examples are chromogenic or fluorigenic substances. The invention 
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thus provides methods and means for the monitoring of a T-cell reactive 
component in food, food component or samples from (suspected) patients. 

Another aspect of the invention is the use of the consensus sequence(s) 
of the invention to search for peptide sequences which could invoke immune 
5 responses. One of the possible ways to achieve this is to screen existing protein 
or nucleotide databases and to identify amino acid sequences or nucleotide 
sequences coding for amino acid sequences which match with the consensus 
sequence of the invention. Methods for identification of matching sequences in 
databases, such as the BLASTN computer program (which is publicly 
10 available, for instance through the National (Tentre for Biotechnological 

Information, accessible via the internet on http://www.ncbi.nlm.nih.gov/) are 
well known to persons skilled in the art. 

Furthermore the invention provides a method to select and/or breed a cereal 
which has a decreased amount of or is substantially free of peptide sequences 

15 which are toxic for celiac disease patients. Cereals are selected for the absence 
of peptide sequences matching the consensus sequence(s) of the invention 
according to any of the above described methods. Such selected cereals are 
than included in breeding schemes using normal agricultural breeding 
methods to develop cereal varieties useful for food production which exhibit a 

20 better tolerance to celiac disease patients. Cereal, in this application, relates to 
grain or related grasses or plants that produce it and to the (prepared) 
foodstuff. In particular wheat gluten, but also rye, and to a lesser extent barley 
and oat are preferred cereals. 

Because the consensus sequence(s) for determining peptides which cause 
25 immunological effects are disclosed herein, one is now also able to genetically 
modify the genome of cereals to generate new cereals with a decreased source 
of toxic peptides. Modifications are, for example, generated by point-mutations 
in the nucleic acid sequence coding for the gluten proteins or are generated by 
replacing, e.g. through homologous recombination, a sequence coding for an 
30 amino acid sequence which matches the consensus 8equence(s) of the invention 
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by another sequence not giving rise to amino acid sequences matching the 
consensus sequence(s). Further, expression of a protein which contains toxic 
peptide sequences may be prohibited by gene silencing. Numerous methods for 
gene silencing in plants, such as antisense expression, sense co-suppression 
5 and RNAi are known in the art and readily applicable by a person skilled in 
the art. 

Since, according to this invention, the presence of proline residues determine 
for a large part the susceptibility of a peptide for deamidation by tTG, it is 
preferred to change the nucleotide sequences coding for the amino acid 

10 sequences in such a way that the proline residues of peptide fragments which 
match the consensus sequence are changed into other residues. Such 
conservative substitutions of the proline residues are possible without 
substantially affecting the chemical and physical properties that are so desired 
in gluten. Cereal plants, thus genetically modified, can again be used in cross- 

15 breeding programmes in order to come to a variety which can be used in 
agriculture. 

A cereal selected and/or bred according to a method of the invention is used to 
prepare food low or preferably free of toxic peptides involved in food-related 
immune enteropathy. 

20 Further, the invention also comprises methods to identify possible toxic 

peptides from auto-antigens. According to any of the above-mentioned methods 
peptide sequences from antigens which are deemed to play a role in auto- 
immune diseases can be detected. Table 5 lists a number of predicted toxic 
peptide sequences comprised in proteins which are believed to play a role in 

25 diseases such as diabetes, rheumatoid arthritis, multiple sclerosis, systemic 
lupus erythematosus, Sjogren syndrome and the like. It is now possible, using 
the information from the consensus sequence(s) of the invention to identify 
toxic peptides from yet other auto-antigens. Neo-epitopes can be formed from 
these peptide sequences by deamidation with tTG. It is now possible for a 

30 person skilled in the art to investigate whether these neo-epitopes indeed play 
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a role in auto-immune diseases, A simple way to do this is to screen serum, 
lymph or any other biological material from auto-immune disease patients for 
the presence of T cells and/or antibodies against such neo-epitopes (which do 
not, or to a lesser extent, occur in non-autoimmune diseased control patients). 
5 This would further allow for a diagnostic method for assessing disease or 
predisposition to disease on basis of the occurrence of said antibodies. 
Furthermore the invention provides a method to decrease or more 
• preferably completely inhibit the binding of a T-cell receptor to a gluten- 
derived epitope or neo-epitope involved in auto-immune disease comprising 

10 providing a blocking substance of said T-ceU receptor. By decreasing and more 
preferably completely inhibiting the binding of the T-cell receptor to an HLA- 
bound gluten-derived epitope or neo-epitope, effects of the immune disease are 
decreased or preferably completely diminished. Such a blocking substance 
prevents the association of the T-cell receptor with the HLA-bound gluten 

15 peptide and prevents the T-cell receptor in its activity. For example such a 

blocking substance is a natural or synthetic variant of a gluten-derived epitope 
or neo-epitope according to the invention, for example an altered peptide 
ligand, and is especially well suited for use in a therapy against gluten-derived 
peptide or auto-antigen derived peptide sensitivity. It is clear that the binding 

20 of said blocking substance to a T-cell receptor does not allow functional 
signalling of a T-cell comprisrag said T-cell receptor. 

Furthermore gluten-derived peptides or auto-antigen derived peptides 
according to the invention are used to prepare therapeutic agents capable of 
eliminating a subset of cells, directly or indirectly, especially gluten-sensitive 

25 or auto-antigen sensitive T-cells. This means that an agent, which typically 
comprises a gluten-derived peptide or an auto-antigen derived peptide 
according to the invention as recognised selectively by T-cells, which agent 
induces ehmination of the cells recognising said peptide, is administered to the 
patient. Such an agent most typically also comprises a toxic moiety to mediate 

30 the elimination of the specific T cells. 
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In yet another embodiment the invention provides a method to decrease 
(or more preferably completely inhibit) the binding of gluten-derived peptides 
involved in food-related immune enteropathy (for example celiac sprue, 
tropical sprue, giardiasis or food allergies of childhood) or auto-antigen derived 
5 peptides involved in auto-immune diseases to T cell receptor molecules 

comprising providing substances that block the binding of said peptides to said 
receptor molecules. By decreasing and more preferably completely inhibiting 
the binding of a gluten-derived epitope to an HLA-DQ molecule or the binding 
of a neo-epitope to its receptor at the T cell, effects of the immune related 

10 disease are decreased or more preferably, completely diminished. 

A blocking substance associates with, for example, a T cell receptor molecule 
and prevents the epitope to associate with said molecule. Such a blocking 
substance is, for example, a natured or synthetic variant of a gluten-derived 
epitope or neo-epitopQ. It is clear that the binding of said blocking substance to 

15 said T cell receptor is such that it decreases or more preferably completely 
diminishes the binding of a gluten-derived epitope or neo-epitope to said 
receptor molecule. Another way to decrease or more preferably to completely 
inhibit the binding of a gluten-derived epitope to an HLA-DQ molecule or a 
neo-epitope to its T cell receptor, is by providing an antibody which associates 

20 with said gluten-derived epitope or neo-epitope. 

In yet another embodiment the invention provides a method to detect 
and/or enumerate T-cells bearing a T-cell receptor according to the invention 
comprising tetrameric complexes of HLA molecules and a gluten-derived 
epitope or neo-epitope according to the invention. Methods to arrive at such a 

25 tetrameric complex are known in the art. In another embodiment the invention 
provides a method to detect and/or enumerate T-cells bearing a T-cell receptor 
according to the invention comprising (synthetic) hposomes comprising 
complexes of HLA molecules and a gluten-derived epitope or neo-epitope 
according to the invention. Methods to arrive at such complexes are known by 
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the person skilled in the art. These data provide a novel tool for detection and 
enumeration of T-cells comprising a T-cell receptor according to the invention. 

In another embodiment the invention provides a pharmaceutical 
composition comprising a gluten-derived peptide or auto-antigen derived 
peptide according to the present invention. Such a pharmaceutical composition 
is used for the induction of tolerance against said gluten-derived or auto- 
antigen derived peptide. For tolerance induction doses of this peptide 
according to the invention are given repeatedly, for instance intravenously, but 
other routes of administration are suitable too. Another possibility is the 
repeated oral or nasal administration of such a peptide. Such a peptide 
according to the present invention is given alone, or in combination with other 
(toxic) gluten-derived peptides and/or auto-antigen derived peptides, or as part 
of larger molecules, or coupled to carrier materials/molecules. A 
pharmaceutical composition comprising a peptide according to the present 
invention can also be used for elimination of a certain subset of T-cells or for 
the treatment of celiac disease or auto-immune diseases. Preferably such a 
pharmaceutical composition according to the present invention contains 
various, different kinds of, gluten-derived or auto-antigen derived peptides. 

In yet another embodiment use is made of a protease inhibitor or of acid 
neutralizing substances for preventing the generation of a gluten- derived or 
auto-antigen derived peptide according to the invention or a polypeptide 
comprising such a peptide. The proteins which comprise the gluten-derived or 
the auto-antigen derived peptides are not capable of binding to an HLA 
molecule directly and must first be processed by proteases to provide a peptide 
or peptides capable of binding to an HLA molecule. Gluten-derived peptides 
and polypeptides comprising a gluten-derived peptide according to the 
invention are bound to HLA-DQ molecules and are thereby recognized by a T- 
cell receptor. By preventing the formation of gluten-derived peptides, binding 
to HLA-DQ molecules and recognition by T-cell receptors is prevented. One 
way to prevent a gluten-derived peptide from being generated is by inhibiting 
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the enzyme (for example by protease inhibitors) which is capable of processing 
the proteins from which the gluten-derived peptides are derived (for example 
glutenins and/or gliadins). Another way to prevent the gluten-derived peptides 
from being generated is inactivating the enzyme, which is capable of 
5 processing the proteins from which the gluten-derived peptides are derived by 
providing neutralizing substances. Pepsin and trypsin are examples of 
enzymes that work under acidic conditions and by providing neutralizing 
substances the effects of these enzymes are diminished or more preferably 
completely inhibited. 

10 In the case of auto-antigen derived peptide or polypeptides comprising those, 
inhibition of the reaction which deamidates said peptides to form neo-epitopes 
would yield a decrease of the antigenic effect of the auto-antigen. Preferably 
said inhibition is accomplished by local inhibition of the enzyme tTG at the 
8ite(s) where normally these neo-epitopes are formed. For celiac disease, site 

15 specific administration in the gut would be the method of choice. In RA, the 
specific site to inhibit the formation of neo-epitopes would be the joints, while 
for Sjogren syndrome the site would be the tear and salivary glands. Inhibition 
of the enzyme can be accomplished by specific inhibitory compoimds, e.g. by 
antibodies binding to the enzyme in such a way that the enzymatic function is 

20 decreased. Another approach is to design specific compounds on the basis of 
the defined specificity of tTG for particular gluten sequences like QLPF and 
QXXF. Site-specific action of such compounds can be achieved by local 
application of the inhibitor, e.g. by injection. An example of a tTG 
inhibitor/blocker is disclosed herein within the experimental part. A lead 

25 compound was designed on the basis of an octapeptide, sequence QPQLPYPQ, 
known to be an tTG substrate {Arentz-Hansen, H., R. Korner, O. Molberg, H. 
Quarsten, W. Vader, Y. M. Kooy, K E. Lundin, F. Koning, P. Roepstorff, L. M. 
SoUid, and S, N. McAdam. 2000. The intestinal T cell response to alpha-gliadin 
in adult celiac disease is focused on a single deamidated glutamine targeted by 

30 tissue transglutaminase. J.EsqpMed 191:603-612} and containing the 
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sequence QLPY which allows the transformation of the indicated glutamine 
(Q) into glutamate. On basis of the structure as disclosed in Fig. 3, a person 
skilled in the art is capable of introducing modifications which leave the 
inhibitor function intact but which would make the compound more suitable 
5 for therapeutic applications. For example, an optimum between 

pharmacological efficacy and ease of synthesis will be established by trimming 
the oligopeptide sequence since previous results indicate that the minimal 
peptide fragment required for recognition by tTG is a peptide firom 3 to 4 
amino acids in length {as disclosed herein and in Vader, L. W., A. De Ru, W. Y. 

10 van der, Y. M. Kooy, W. Benckhuijsen, M. L. Mearin, J. W. Drijfliout, P. Van 
Veelen, and F. Koning, 2002. Specificity of tissue transglutaminase explains 
cereal toxicity in celiac disease. J.ExpMed. 195:643-649}. Furthermore, 
recognition of the inhibitor by tTG may be enhanced by modifying the amino 
acid composition and sequence. First of all, amino acid sequences having the 

15 sequence QXP, QXXF or QXPF, as discussed earlier, are especially useful, - 
since these sequences are shown to be target sequences for the enzyme. 
Further, the binding strength of the inhibitor can be modulated by 
incorporating different electrophilic traps, either reversible (aldehyde, 
boronate) or irreversible (chloromethyl ketone, vinylsulfone, epoxyketone). 

20 Protease resistance is another important point which can be addressed by 
replacement of several selected peptide bonds with isosteres {e.g. N- 
alkylamide, sulfonamide, urea or alkene moieties). In this respect, it is of 
interest to study the effect of incorporation of sugar amino acids in the peptide 
sequence. 

25 

In another embodiment, the invention provides a tTG inhibitor/blocker 
characterised in that said inhibitor/blocker is derived from a peptide sequence 
as disclosed herein and further comprises an electrophilic trap. In yet another 
embodiment, the invention provides a tTG inhibitor/blocker characterised in 
30 that said inhibitor/blocker is derived from the peptide QPQLPYPQ and further 



wo 03/066079 



PCT/NLO3/O0O77 



24 

comprises an electrophilic trap. Preferably, the electrophilic trap replaces the 
carboxamide moiety of a glutamine that is converted into a glutamate by the 
action of tTG. As disclosed herein within the experimental part, replacement of 
the carboxamide moiety of the glutamine at position 3 in the peptide 
5 QPQLPYPQ by a vinylogous ester as an electrophilic trap leads to effective 
inhibition of tTG. It is clear to a person skilled in the art that other 
electrophilic traps can also be used. An example of a tTG inhibitor/blocker is 
depicted in Figure 3. 

10 The invention also provides a pharmaceutical composition comprising a tTG 
inhibitor/blocker as described above or a functional equivalent and/or 
functional fragment thereof. Furthermore, the invention also provides use of a 
tTG inhibitor/blocker as described above or a functional equivalent and/or 
functional fragment thereof for the preparation of a medicament for the 

15 treatment of celiac disease or auto-immune disease. 

The invention will be explained in more detail in the following detailed 
description which is not limiting the invention. 



1 
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Materials and Methods 



Peptides and ITG treatmenL Peptides were synthesised by standard Fmoc 
chemistry on a multiple peptide synthesizer (SyroII. MultiSynTech GmbH, 
5 Witten, Germany). Integrity of synthetic peptides was checked by rpHPLC and 
mass spectrometry. Tissue transglutaminase (tTG) treatment was performed 
by incubating the peptides (500 ^ig/ml) with the enzyme (100 jig/ml, Sigma, 
Zwijndrecht, NL; T-5398) at 37oC for 4 h minimum, in 50 mM triethylamine- 
acetate pH 6.5, 2 mM CaCh- 

10 

Mass spectrometry. Electrospray ionisation mass spectrometry was performed 
on the synthetic gluten peptides before and after tTG treatment, using a Q- 
TOF hybrid mass spectrometer (Micromass. Manchester, UK). Precursor ions 
were selected with the quadrupole window set to 3 Da and fragments were 

15 collected with high efficiency with the orthogonal time of flight mass 

spectrometer. The collision gas applied was argon (pressure 4x10-5 mbar) and 
the collision voltage approximately 30 V. Deamidation of glutamine residues in 
synthetic gluten peptides was determined by mass spectrometric analysis as 
described previously (van de Wal, 1998a). Deamidation of a glutamine residue 

20 results in an increase of 1 Da. These conversions were assigned to particular 
glutamine residues by comparison of the fragmentation spectra of tTG treated 
and non-treated peptides. 



Database searching. The program PeptideSearch was used for pattern searches 
25 Mann, M. and Wilm, M., Anal. Chem. 66: 4390-4399. 1994). The patterns were 
composed on the basis of the newly identified specificity of tissue 
transglutaminase in combination with the previously identified HLA-DQ2 
peptide binding motif (Johansen 1996; van de Wal, 1996). For database 
searching a selected subset of wheat proteins from a compilation of the Swiss 
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Prot databank, SPTREMBL and PIR was used. This compilation contained 982 
proteins. 

Tcell clones and T cell lines. Gluten specific T cell lines were generated from 
5 small intestinal biopsies of celiac disease patients as described before (van de 
Wal, 1998a). 



Tcell proliferation assays. Proliferation assays were performed in duplicate in 
150 \d RPMI1640 (Gibco, Breda, NL) supplemented with 10% human serum in 
10 96-well flat-bottomed plates (Falcon) using 10^ T cells stimulated with 10^ 
irradiated PBMCs (3000 RAD) in the presence or absence of antigen (1-10 
pg/ml). After 48 hours at ST^C, cultures were pulsed with 0.5 nCi of ^H- 
thymidine and harvested 18 hours thereafter as described previously (van de 
Wal, 1998b). 



Results 

The specificity oftTG. Previously we have characterized an HIjA-DQ8 
restricted, T cell stimulatory gliadin peptide (van de wal, 1998a;1998b). In the 

20 core of this gliadin peptide (sequence PSGQ208GSFQPSQQ216NPQA) the Q208 
and Q216 are deamidated by tTG, whereas positions Q2i2, Qais, and Q219 are not 
(van de Wal, 1998a). Preliminary studies indicated that the nature of C- 
terminal flanking amino acids influenced deamidation of glutamine residues 
(not shown). We have now investigated this systematically by analyzing the 

25 deamidation in amino acid substitution analogs of a shorter version of the 
gliadin peptide (sequence: PSGQ208GSFQPSQQ216N). In this peptide only the 
Q208 is deamidated by tTG treatment (Fig. 1). An example of this type of 
analysis shows the simidtaneous mass spectrometric analysis of 4 of these 
substitution analogs, that have either the amino acid V, D, E, or F at position 

30 Q208+3, and reveals the distinct masses corresponding to these peptides (Fig. 



wo 03/066079 



PCT/NL03/00077 



27 

la). Upon treatment of the peptides with tTG two of the masses increase by 1 
Da, indicating conversion of a glutamine into a glutamic acid (Fig. la). 
Subsequent analysis of collision induced fragments of the individual peptides 
(not shown) was used to verify that deamidation occurred at Q208 in these 
5 peptides. These results demonstrate that in the substitution analogs with the 
amino acids V and F at position Q208+3, complete deamidation of Q208 is found 
while in the peptides that carry a D or E at Q208+3, no deamidation of Q2oa 
takes place. The complete analysis demonstrates that at only two positions in 
the peptide substitutions had efiect on deamidation of Q208 (not shown and Fig. 
10 lb). First, the replacement of Q209 by a P abolished deamidation of Q208. 

Second, several amino acid substitutions at F211 had a strong negative effect on 
deamidation of Q208« in particular replacements of F2n for amino acids with 
significantly smaller or charged side chains or replacement by proline (Fig. la, 
b). 

15 Next, we carried out a similar analysis to determine the influence of amino 
acid substitutions on the deamidation of Q2ig in a longer version of the peptide 
(Fig.lb). The results demonstrate that the substitution of N217 with a P 
abolishes deamidation of Q21G, a finding that underscores the observation that 
in a QP sequence the Q is not a target for tTG. Moreover, the substitution of 

20 Q219 with a P had a negative effect on deamidation of Q216, confirming the 
negative mfluence of the P in the sequence QXXP (X stands for any amino 
acid). Amino acid substitutions (with the exception of the P substitution) at 
Q216+3, however, had no effect on deamidation of Q216, a result that is in 
contrast to the strong influence of amino acid substitutions at position Q208+3 

25 on deamidation of Q208. A marked difference between the sequence Q208GSF 
and Q216NPQ is the presence of a P in the latter sequence at Q+2 suggesting 
that in the sequence QXP the Q is a good target for deamidation and the 
nature of the amino acid at Q+3 is much less important (see also below). 
Together the results indicate that in the sequences QP and QXXP the Q is not 

30 a target for tTG. In contrast the sequences QXP, QXX(F, Y, W, M. L, I, or V) 
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and QXP(F, Y, W. M, L, I, or V) favor deamidation of the Q (Table 1). To verify 
these "rules" for deamidation by tTG, a large panel of gluten peptides was 
synthesized, treated with tTG and analyzed by mass spectrometry (Table 2). 
Without exception we found that a Q preceding a P is not deamidated by tTG 
5 while in the majority of cases a P at Q+3 also inhibited deamidation. In 
contrast, a bulky/hydrophobic residue at Q+3, and P at Q+2 allowed good 
deamidation. In the case that two opposite rules coincide, as is found in 
peptide 20 (LQQPQQePQFQPQQQF). the negative effect of QPi overrules the 
positive effect of QXXFt at position 6. Together, these results confirm the 
10 defined patterns. Proline thus plays a crucial role in the deamidation of gluten 
peptides. 

Prediction of novel gluten epitopes The peptide binding motifs of the disease 
associated molecules HLA-DQ2 and HLA-DQ8 display. preferential binding of 
negative charges at anchor positions (Johansen, 1996; van de Wal, 1996, 

15 Kwok, 1996). HLA-DQ2 has a strong preference for a negative charge at p4 • 
and p7. Strikingly, at p6 a proline is one of the preferred amino acids in the 
HLA-DQ2 peptide binding motif that could thus target specific deamidation at 
p4 (Table 3). Therefore we combined the HLA-DQ2 peptide binding motif with 
the specificity of tTG to construct an algorithm that could be used to predict 

20 novel T cell stimulatory gluten peptides firom the gluten database (Table 3). In 
this algorithm glutamine residues were introduced at the putative DQ2 
binding positions p4 and p7 that serve as potential target sites for tTG to allow 
introduction of a negative charge at these positions. Next, proline was selected 
for p6 to promote deamidation at Q4, which is also consistent with the 

25 requirements for peptide binding at this position (Table 3). For the p9 position 
two options were designed. In the first option bulky/hydrophobic residues were 
selected for p9, according to the peptide binding motif. In the second option, a 
proline residue was placed at p9, followed by bulky/hydrophobic at plO. This 
option does not agree with the peptide binding motif but should allow optimal 

30 deamidation at the Q? according to oiu: QXPF rule. The remaining positions 
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pi, p2, p3, p5. and p8 were not defined (Table 3). These two algorithms were 
used to search in the gluten database. A total of 98 matches were obtained, 
which corresponded to 27 unique gluten sequences. These 27 unique gluten 
peptides were synthesized, and deamidated by tTG. Mass spectral analysis 
5 confirmed deamidation of the predicted Q4- and Qv-residues (not shown). 

The native and deamidated peptides were pooled in pools of five peptides each, 
and tested in T cell proliferation assays with three gluten specific T cell lines 
isolated firom small intestinal biopsies of celiac disease patients. Two of these T 
cell lines responded to stimulation by several of the deamidated peptide pools 

10 (Fig 2a, results shown for one of the T cell lines). The responses were 

exclusively found against pools that contained peptides predicted by the second 
algorithm, X X X Q4 X Pc Q7 X P9 (Y,F.W,I.L)io (Fig. 2a, b). Subsequently, the T 
cell line was tested against the individual peptides of the positive pools, and 
found to respond to 8 out of 13 peptides tested (Fig, 2b). 

15 In addition we performed searches with a less strict variant of algorithm 2, X 
X X X4 X Pe Q? X P9 (Y,F,W,I,L)io, which predicts the deamidation of oiily one 
Q-residue. This search yielded 261 matches in the gluten database (Table 4). 
Eighteen of these matches corresponded to an alpha-gliadin peptide that has 
previously been shown to be immunodominant in adult celiac disease patients, 

20 QLQPFPQPQLPY (Arentz-Hansen, 2000; Anderson, 2000). 

Next we used the three algorithms to perform additional database searches in 
gluten*like storage molecules, in particular the hordeins firom barley, the 
secalins firom rye and the avenins firom oats. While the first two cereals are 
toxic for CD patients, the latter is not (Janatuinen, E.K. et al, N. Engl. J. Med. 

25 333: 1033-1037, 1995; Janatuinen, E.K et al. Gut 46: 327-331. 2000). 

Strikingly, all three search algorithms yielded many matches in the available 
hordein and secalin sequences but none in the avenin sequences (Table 4). 
CJonsequently, the predicted T cell stimulatory gluten peptide 
QQPFPQQPQQPFPQ (Fig 2) is present in hordeins and secaUns but not in 

30 avenins (Table 4). Additional database searches with less strict algorithms 
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revealed a general lack of matches in the avenin database as compared to the 
gluten, hordein and secalin databases (Table 4). Notably, no matches were 
found in the avenins with three variants of the predictive algorithm 2. 
The enzyme pepsin is known to preferentially cleave proteins after 
5 phenylalanine and to a lesser extent after tjrrosine, leucine, and isoleucine. 
Such a cleavage in the stomach will thus generate many gluten peptides with 
the tTG substrate sequence QXPFQO at the C-terminus. The subsequent 
activity of pepsin and tTG, therefore, favors the generation of many gluten 
peptides with an appropriate p7 anchor for binding to HLA-DQ2. We also 

10 demonstrate that by combining the HLA-DQ2 peptide binding motif with two 
deamidation patterns an algorithm is obtained that predicts novel T cell 
stimulatory peptides in gluten. Strikingly, in this predictive algorithm [XXX 
Q4 X Pe Q7 X P9 (y,F,W,I,L)io] we had incorporated a deamidation pattern that 
results in peptides that do not optimally fit the HLA-DQ2 peptide binding 

15 motif, e.g. they lack an amino acid with a large hydrophobic side chain at 
position 9 in the peptide. The lack of this anchor is probably compensated by 
the introduction of two negative charges in the peptide at p4 and p7 due to 
deamidation that serve as strong anchors. Thus, the T cell stimulatory activity 
of these peptides appears to depend more on optimal deamidation than on 

20 adherence to the exact HLA-DQ2 peptide binding motif. A less stringent 
search algorithm, combining only one deamidation rule with the HLA-DQ2 
peptide binding motif proved equally successful since it predicted the 
previously identified T cell stimulatory alpha-gliadin peptide. Importantly, the 
first predictive algorithm identified 13 peptides, 8 out of which stimulated T 

25 cells. The second, less stringent algorithm predicted 261 peptides. Eighteen of 
those (1 in 15) matched a known T cell stimulatory gliadin peptide. These 
algorithms, therefore, have a high predictive value. It is striking, therefore, 
that these algorithms identified very similar and sometimes identical peptides 
in the barley and rye derived hordeins and secalins but not in the oats derived 

30 avenins. These results appear not related to the size of the avenin database 
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since the secalin database is the smallest of the three investigated, Oats is 
considered safe for CD patients (Janatuinen, 1995; 2000). A striking difference 
between the amino acid content of gliadins, hordeins, and secalins on one side 
and the avenins on the other side is the relative lack of proline in the latter 
5 (Wieser, H. et al Z. Lebensm. Unters. Forsch. 177: 457-460, 1983), While 

gliadins, hordeins and secalins contain approximately 36% glutamine and 20% 
proline, avenins contain a similar percentage of glutamine (34%) but half the 
amount of proline (10%). The lack of proUne residues in Q-rich regions leads to 
non-selective deamidation of the glutamine residues in such regions. In the 
10 gluten sequence PFSQQQQPV. for example, the underlined Q-residues are 
specifically and efficiently deamidated by tTG while in the naturally occurring 
homologous sequence PFSQQQQLVL, in which the proline at p8 is replaced by 
a leucine, all Q-residues are deamidated. The deamidated peptide does not 
match the HLA-DQ2 peptide binding motif. Our results therefore offer a 
15 simple explanation for the clinical observation that oats is not toxic for CD 
patients. Moreover, our results indicate that the replacement of particular 
proline residues in gluten by other amino acids would yield gluten molecules 
with considerably less toxicity for CD patients but that would retain the 
chemical and physical properties that are so desired in gluten. Potentially this 
knowledge may be used to construct gluten molecules with lower toxicity or 
design oligonucleotide primers to identify wheat varieties that lack (some of) 
these relevant proline residues. 

tTG inhibitor /blocker 

A lead compoimd was designed on the basis of an octapeptide, sequence 
QPQLPYPQ, known to be a tTG substrate {Arentz-Hansen. H., et al, 2000} 
and containing the sequence QLPY which allows the transformation of the 
indicated glutamine (Q) into glutamate {Vader, L. W. et al 2002}. Based on the 
knowledge that tTG has a cysteine protease mode of action {Liu, S., R. A. 
Cerione, and J. Clardy. 2002. Structural basis for the guanine nucleotide- 
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binding activity of tissue transglutaminase and its regulation of 
transamidation activity. Proc.NatLAcad.Scl U.S.A 99:2743-2747}, we prepared 
with the aid of standard Fmoc-based solid-phase peptide synthesis, 
biotinylated QPQLPYPQ in which the carboxamide moiety of the glutamine at 

5 position 3 in the substrate is replaced by a vinylogous ester as an electrophilic 
trap (C02Et, Fig. 3). The inhibitor was designed in such a way that 
manipulation and visualization could easily be performed in the studies; a 
biotin group was introduced to enable binding of the inhibitor to 
avidin/strepavidin in order to isolate/purify the inhibitor, the tyrosine is a 

10 possible iodination site to label the peptide with a radio-label and a carboxy- 
terminus enables the site-specific coupling of amine-compounds of choice by 
amide bond formation. It is clear that only the amino acid sequence of the 
inhibitor and the electrophilic trap in the side chain of the amino acid at 
position 3 are important parameters for inhibitory activity and that the other 

15 provided side groups are optional and are not important for the activity of the 
inhibitor/blocker. 

The incorporation of the active site inhibitor in place of the glutamine residue 
was effected by coupling of a glutamic aldehyde species, protected as its 
dimethyl acetal. Upon acidic cleavage from the solid phase, the aldehyde was 
20 liberated and a Wittig reaction carried out on the crude peptide, jdelding the 
target compound after HPLC purification. 

This first-generation inhibitor was found to effectively inhibit the enzyme 
activity. When mixed with the enzyme in an equimolar ratio, the deamidation 
of a substrate peptide was inhibited for over 50%, at a 0.3 molar ratio of 
25 enzyme to inhibitor the enzyme activity was completely abolished, as 

demonstrated by mass spectrometric analysis of the substrate peptide as 
. described {Vader, L. W, et a/., 2002}. (Fig. 4). 
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Table 1. Deamidation patterns in the HLA-DQ8 restricted gliadin peptide. 



Pattern 


Effect on deamidaition 


QPi 


no deamidation 


QXX(F.Y,WXL.I.V)t 


good deamidation 


QXXP4^ 


no deamidation 


QXPt 


good deamidation 


QXP(F,Y.W.M.L.I,V)t 


good deamidation 
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Table 2. Validation of the deamidation patterns. 

The observed deamidation patterns in a set of twenty-one gluten peptides were 
compared with those predicted according to the deamidation patterns given in Table 
1. The deamidation patterns are represented in the four columns. The percentages 
5 shown indicate whether the pattern is fully or partially applicable to the specific 

deamidation of Q residues in the listed gluten peptides. A bar (-) indicates the absence 
of that sequence in the peptide. Deamidated glutamine residues are shown in bold 
and underlined. 



Peptides 


QPi 


Rules 
QXXF QXXP 
T i 


QXPt 


1 


VPVPQLQPQNPSQ.QQPQEQVPL 


100% 


100% 


100% 


66% 


2 


PLVQQQQFLGQQQPFPPQ 


100% 


100% 


100% 


100% 


3 


YYPTSPQQSGQG 










4 


QGQQGYYPTSPQ 




50% 




II^BHii 


5 


SSQVSFQPSQLN 


100% 


100% 






6 


FPQTQQPQQLFPQSQ 


100% 




100% 


100% 


7 


GQQGYYPTSVQQSGQ 




50% 






8 


GSVQPQQQLPQFEm 


100% 


100% 


100% 


100% 


9 


FPQHCNYQQQPQTFPQPF 


100% 




100% 


100% 


10 


QQPIQPQQFPQQQFF 


100% 


50% 


100% 


50% 


11 


QSGQYQQQPQMQTT 


100% 




100% 


100% 


12 


ESQQSQQDEPQPF 


100% 




100% 




13 


FPQPEDQQSQQSE 


100% 






14 


LQQVQQGPQQQPFPQPQPF 


100% 


100% 


66% 


100% 


15 


QVQWPQQQPFPQPQQPF 


100% 


50% 


100% 


33% 


16 


PLLPQQPFPSQQEQPQF 


100% 


50%-^ 


100% 


100% 


17 


WQQQPPFSQEQPIL 


100% 




33% 


100% 


18 


QSNLPQPAQQPFQPQVPQOP 


100% 


100% 




66% 


19 


WFQPSQLNPQAQQDQPQ 


100% 


immii 


100% 




20 


LQQPQQPQFQPQQQF 


100% 


50%-^ 


100% 


50% 


21 


VQQQIPWQPSIL 


100% 


33%'^ 


100% 


100% 



^QP inhibition of deamidation overrules the positive influence of QXXF 



BEST AVAIUBLE COPY 
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Table 3. Design of search algorithms based on the HLA-DQ2 peptide binding motif 
and the tTG specificity. 



D02 oeDtide binding motif — 


Residue 


.2 .1 1 2 3 4 5 6 7 8 9 10 11 12 


Preferre 
d 


F,W. D.E p. jj F,W,Y 
V I ^ ^ M 


DQ2 epito 


pe selection aigorithnas . 


Algorith 
ml 


XX X X X Q X p Q X ^-y^w X XX 


Algorith 
m2 


XXXXXQXPQXP^^XX 


Gluten D 


ntnliAQA search — 


Algorith 
m 1 


Matches: 52 
Unique: 14 


Source: Gliadin (6); Glutenin (8) 


Algorith 
m 2 


Matches: 46 
Unique: 13 


Source: Gliadin (aU) 
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Table 4. Database searches with the search algorithms. 

The search algorithms reveal many matches in the gluten, hordeins and secalin 
sequences but very few in the avenin sequences. The predictive algorithm 2 
[QXPQXP(YFWIL)1 and the less strict variants of that algorithm have no matches in 
the avenin database. The novel T cells stimulatory gluten peptide 
(QQPFPQQFQQPFPQ) is also found in the hordeins and secalin but not in the 
avenins. Other search algorithms identify only a limited number of sequences in the 
avenins. 



Search algorithm 


# of matches in databases 






gluten 


hordein 


secalin 


avenin 


Algorithm 1 


Q4XPQ7X(YFWIL) 


52 


48 


8 




Algorithm 1 - Q4 


XXPQtXCYFWIL) 


334 


91 


14 


6 


Algorithm 1 - Q7 


Q4XPXX(YFWIL) 


286 


142 


60 


11 


Algorithm 2 


Q4XPQ7XP(YFWIL) 


46 


60 


33 




Algorithm 2 - Q4* 


XXPQtXPCYFWIL) 


261 


196 


89 




Algorithm 2 - Q? 


Q4XPXXP(YFWIL) 


51 


68 


33 




Short algorithm 


QXP(YFWIL) X* P** 


>500 


276 


105 


7 


Predicted epitope 


QQPFPQQFQQPFPQ 


6 


12 


2 





10 



a9 epitope 

** A proline at p2 in this algorithm inhibits deamidation of the glutamine 



\ 

« 
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Table 5 

List of protein autoantigens in multiple sclerosis (MS), rheumatoid arthritis (RA), 
diabetes (DB), systemic lupus erythematosus (SLE), Sjogren syndrome (SS) and 
Coeliac disease (CD) and peptides derived from these proteins that contain the tTC- 
5 substrate motif disclosed herein' 





Autoantigen^ 


Code 
used' 




Accession 
number 


Amino 
acids^ 


10 


MS 












Myelin basic protein 


MBP 


Swiss Prot 


P02686 


304 


15 


Myel in-oligodendrocyte 
glycoprotein 


MOG 


Swiss Prot 


Q16653 


247 




Myelin proteoliptd protein 


PLP 


Swiss Prot 


P06905 


276 


20 


RA 

Complement Clq 
subcomponent, A chain 


CIQA 


Swiss Prot 


P02745 


245 


25 


Complement Clq 
subcomponent, B chain 


CIQB 


Swiss Prot 


P02746 


251 




Complement Clq 
subcomponent, C chain 


CIQC 


Swiss Prot 


P02747 


245 


30 


Carbonic anhydrase II 


CA2 


Swiss Prot 


P009I8 


259 




Calpain inhibitor 


CALP 


Swiss Prot 


P208I0 


708 


35 


Alpha-] type II collagen 


C0L2A1 


TrEMBL 


Q14047 


1487 


Filaggrin 


FILA 


Swiss Prot 


P20930 


416 


40 


78 kDa glucose* 
regulated protein 


GR78 


Swiss Prot 


PI 1021 


654 


Matrix metalloprotease-l 


MMPl 


Swiss Prot 


P03956 


469 




Matrix metal loprotease-3 


MMP3 


Swiss Prot 


P08254 


477 


45 


Matrix metaIloprotease-19 


MMPI9 


Swiss Prot 


Q99542 


508 




Heterogenous nuclear 
ribonucleoproteins A2/B 1 


R0A2 


Swiss Prot 


P22626 


353 
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Pulmonary surfactant- 
associated protein A 


SPA 


Swiss Prot 


P07714 


248 


5 


Vinculin 


VINO 


Swiss Prot 


PI 8206 


106S 




Chitinase 3-like protein 2 


YKU9 


Swiss Prot 


QIS782 


390 




Chitinase 3-like protein I 


YKL40 


Swiss Prot 


P36222 


383 


10 














Carboxypeptidase H 


CPH 


Swiss Prot 


P16870 


476 


15 


Deha-like protein 


DLK 


Swiss Prot 


P80370 


383 


Glutamate decarboxylase, 
65 kDa isoform 


GAD 


Swiss Prot 


Q05329 


585 


20 


Glucose transporter type 4 


GLUT4 


Swiss Prot 


P14672 


509 


60 kDa Heat shock protein 


HSP60 


Swiss Prot 


PI08a9 


573 


25 


Protein-tyrosine 
phosphatase-like N 


JCA 


Swiss Prot 


Q16849 


979 


69 kDa islet cell 
autoantigen 


ICA69 


Swbs Prot 


Q05084 


480 


30 


Imogen 44 


IM44 


TrEMBL 


Q6I733 


384 


Insulin precursor 


INS 


Swiss Prot 


P01308 


no 




Insulin receptor 


INSR 


Swiss Prot 


P06213 


1382 


35 


Insulin receptor substrate- 1 


IRSI 


Swiss Prot 


P35568 


1242 




SOX- 13 protein 


SOX 13 


Swiss Prot 


Q9UN79 


889 


40 


SLE/SS 

Centromere protein A 


CENPA 


Swiss Prot 


P49450 


140 




Centromere protein B 


CENPB 


Swiss Prot 


P07199 


599 


45 


DNA excision repair 
protein ERCC-1 


ERCCl 


Swiss Prot 


P07992 


297 




60S ribosomal protein L7 


RL7 


Swiss Prot 


P18i24 


248 


50 


52 kDa ribonucleoprotein 
autoantigen RO/SS-A 


ROSS 


Swiss Prot 


P27797 


417 


55 


Small nuclear ribonucleo- 
protein associated 
proteins B and B* 


SMPB 


Swiss Prot 


PI4678 


240 
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Small nuclear ribonucleo- 
pr otein associated 



protein N 


SMPN 


Swiss Prot 


P 14648 


240 


5 SNRPB protein 


SNRPB 


TrEMBL 


QI5182 


285 


SSA protein SS-56 


SS56 


TrEMBL 


Q96PF7 


485 



Miscellaneous/CD 

Tiisue transglutaminase TTG Swiss Prot P2I980 687 



This table reflects the consensus amino acid sequences of the proteins mentioned. It is 
knovvn that for some proteins mentioned variants exsist, like splice variants and variants 
15 having one or more amino acid mutations. These variants should be regarded as being 
comprised in the table. Also the peptides that are derived from these variant and that 
contain the tTG-substrate motif disclosed herein should be regarded as being comprised in 
this table. 

^ Some of the proteins listed are also known by one or more other names 
20 ^ For some of the proteins listed one or more other codes are used in databases 

Number of amino acids given here reflect the amino acid sequence indicated, numbers 
might be different for variants 

Some antigens are known to possibly play a role in more then one of the diseases 
mentioned. The way the antigens are grouped is only meant for clarity and is by no means 
25 intended to limit the claims 

Peptides are named by the sequence number in the protein that corresponds to the amino- 
terminal amino acid of the peptide 

Only 1 9-mer peptides having the target Q at position 10 are shown. It is knowoi that the 
length of HLA-bound peptides can vary from 8 to up to 40 amino acids. Longer peptides 
30 containing a 1 9-mer sequence or part of a 1 9-mer sequence from the table and shorter 
peptides containing part of a 1 9-mer sequence from the table are considered to be 
comprised in this table 
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Multiple sclerosis (MS) 

5 MBP 



246 SRFSWGA£GQRPGrGYGGR 
272 HKGFKGVDAQGTLSKIFKL 



10 

MOG 



15 



22 


L 


Q 


V 


S 


S 


S 


Y 


A 


G 


Q 


F 


R 


V 


I 


6 


P 


R 


H 


P 


81 


R 


N 


G 


K 


D 


Q 


D 


G 


D 


Q 


A 


P 


E 


Y 


R 


G 


R 


T 


E 


159 


L 


A 


V 


L 


P 


V 


L 


L 


L 


Q 


I 


T 


V 


G 


L 


V 


F 


L 


C 


170 


T 


V 


G 


L 


V 


F 


L 


C 


L 


Q 


Y 


R 


L 


R 


G 


K 


L 


R 


A 



20 



PLP 



59 YLINVIHAFQYVIYGTASF 
224 LSICKTAEFQMTFHLFIAA 



Rheumatoid arthritis (RA) 

25 

CIQA 

159 YYFTFQVLSQWEICLSIVS 
30 20S MVLQLQQGDQVWVEKDPKK 



CIQB 

35 15 LLLGLIDISQAQLSCTGPP 

181 NLMRGRERAQKVVTFCDYA 



CIQC 

40 

110 EEGRYKQKFQSVFTVTRQT 



CA2 

45 



50 



55 



18 


D 


F 


P 


I 


A 


K 


G 


E 


R 


Q 


S 


P 


V 


D 


I 


D 


T 


H 


T 


82 


P 


L 


D 


G 


T 


Y 


R 


L 


I 


Q 


F 


H 


F 


H 


W 


G 


S 


L 


D 


125 


K 


Y 


G 


D 


F 


G 


K 


A 


V 


Q 


Q 


P 


D 


G 


L 


A 


V 


L 


G 


147 


K 


V 


G 


S 


A 


K 


P 


G 


L 


Q 


K 


V 


V 


D 


V 


L 


D 


S 


I 



CALP 



81 


K 


K 


A 


V 


S 


R 


S 


A 


E 


Q 


Q 


P 


S 


E 


K 


S 


T 


E 


P 


95 


K 


S 


T 


£ 


P 


K 


T 


K 


P 


Q 


D 


M 


I 


S 


A 


G 


G 


E 


S 


536 


A 


A 


A 


I 


S 


B 


V 


V 


S 


Q 


T 


P 


A 


S 


T 


T 


Q 


A 


G 


639 


E 


D 


S 


K 


K 


P 


A 


D 


D 


Q 


D 


P 


X 


D 


A 


L 


S 


G 


D 
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C0L2A1 



116 


V 




It 




o 


n 

r 


n 
f 


is 






u 


p 

It 


A 


o 


E Q G 


P 


R 




p 




P 


Ci 




r 


A 




L 


Q 


b 




n 
l\ 




D R G 


D 


K 




f* 


c 


n 
u 


b 


K 


A 


G 


G 


A 


Q 


L 




ir 
V 


M 

M 


Q G P 


M 


G 


191 




ri 


(5 


A 


y 


T 
Ij 




w 

V 


M 






p 


M 


Vj 


P M G 


P 


R 


380 


T 


G 


A 


R 


G 


P 


e 


G 


A 


Q 


G 


P 


R 


G 


E P G 


T 


P 


458 


P 


G 


I 


A 


G 


F 


K 


G 


E 


Q 


G 


P 


K 


G 


E P G 


P 


A 


511 


A 


P 


G 


N 


R 


G 


F 


P 


G 


Q 


D 


G 


L 


A 


G P K 


G 


A 


563 


T 


G 


R 


P 


G 


D 


A 


G 


P 


Q 


G 


K 


V 


G 


P S G 


A 


P 


707 


R 


G 


S 


P 


G 


A 


Q 


G 


L 


0 


G 


P 


R 


G 


L P G 


T 


P 


857 


K 


G 


D 


A 


G 


A 


P 


G 


P 


Q 


G 


P 


S 


G 


A P 6 P Q 


866 


Q 


G 


P 


S 


G 


A 


P 


G 


P 


Q 


G 


P 


T 


G 


V T G 


P 


K 


938 


P 


G 


R 


A 


G 


E 


P 


G 


L 


Q 


G 


P 


A 


G 


PPG 


E 


K 


970 


P 


P 


G 


P 


Q 


G 


L 


A 


G 


Q 


R 


G 


I 


V 


G L P 


G 


Q 


1085 


Q 


G 


D 


R 


G 


E 


A 


G 


A 


Q 


G 


P 


M 


G 


P S G 


P 


A 


1100 


S 


G 


P 


A 


G 


A 


R 


G 


I 


Q 


G 


P 


Q 


G 


P R G 


D 


K 


1103 


A 


G 


A 


R 


G 


I 


Q 


G 


P 


Q 


G 


P 


R 


G 


D K G 


E 


A 


1365 


N 


L 


A 


P 


N 


T 


A 


N 


V 


Q 


M 


T 


F 


L 


R L L 


S 


T 



20 



FILA 



62 


S 


R 


H 


S 


T 


S 


Q 


C 


G 


Q 


D 


T 


I 


H 


G H 


R 


G 


S 


199 


A 


S 


R 




H 


H 


G 


s 


A 


Q 


E 


Q 


L 


R 


D G 


S 


R 


H 


386 


S 


R 


H 


S 


A 


S 


Q 


D 


G 


Q 


D 


T 


I 


R 


G H 


P 


G 


S 



GR78 

30 



100 


G 


R 


T 


W 


N 


D 


P 


S 


V 


Q 


Q 


0 


I 


K 


F 


L 


P F 


K 


120 


V 


£ 


K 


K 


T 


K 


P 


Y 


I 


Q 


V 


D 


I 


G 


G 


G 


Q T 


K 


251 


T 


H 


L 


G 


G 


£ 


0 


F 


D 


Q 


R 


V 


M 


£ 


H 


F 


I K 


L 


295 


A 


K 


R 


A 


L 


S 


s 


Q 


H 


Q 
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IRSl 
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Miscellaneous/Celiac disease 
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Claims 

1. Peptide sequence of 14-40 amino acids, which is prone to deamidation by 
tTG and which is a causative factor of celiac disease, which comprises an 
amino acid sequence selected from the group consisting essentially of 
QQPYPQQPQQPFPQ, QQPFPQQPQQPFPQ, PFPQTQQPQQPFPQ, 

5 PFPQLQQPQQPFPQ, QFPQTQQPQQPFPQ. QLPFPQQPQQPFPQ. 
PFPQPQQPQQPFPQ and PFPQSQQPQQPFPQ. 

2. Epitope which is the causative factor of celiac disease, selected from the 
group consisting essentiaUy of QQPYPEQPQQPFPQ, QQPFPEQPQQPFPQ, 
PFPQTEQPQQPFPQ. PFPQLEQPQQPFPQ, QFPQTEQPQQPFPQ, 

10 QLPFPEQPQQPFPQ. PFPQPEQPQQPFPQ. PFPQSEQPQQPFPQ, 
QQPYPEQPEQPFPQ. QQPFPEQPEQPFPQ, PFPQTEQPEQPFPQ, 
PFPQLEQPEQPFPQ, QFPQTEQPEQPFPQ. QLPFPEQPEQPFPQ. 
PFPQPEQPEQPFPQ, PFPQSEQPEQPFPQ, QQPYPQQPEQPFPQ, 
QQPFPQQPEQPFPQ, PFPQTQQPEQPFPQ, PFPQLQQPEQPFPQ, 

15 QFPQTQQPEQPFPQ, QLPFPQQPEQPFPQ, PFPQPQQPEQPFPQ and 

PFPQSQQPEQPFPQ, wherein further Q residues may be deamidated into an E 
residue. 

3. Peptide sequence, which is prone to deamidation by tTG and which is a 
causative factor of an autoimmune disease selected from the group consisting 

20 essentially of RA (rheumatoid arthritis). MS (multiple sclerosis), SLE (systemic lupus 
erythematosus), SS (Sjogren syndrome) and DB (diabetes), selected from the group 
consisting essentially of: 

SRFSWGAEGQRPGFGYGGR, 

HKGFKGVDAQGTLSKI FKL, 
25 LQVSSSYAGQFRVIGPRHP, 

RNGKDQDGDQAPEYRGRTE, 

LAVLPVLLLQITVGLVFLC, 

TVGLVFLCLQYRLRGKLRA, 

YLINVIHAFQYVIYGTASF, 
30 LSICKTAEFQMTFHLFIAA, 

YYFTFQVLSQWEICLSIVS, 

MVLQLQQGDQVWVEKDPKK, 
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GEDTSMNLVQKYLENYYDL, 

PVVKKIREMQKFLGLEVTG, 
TDLTRFRLSQDDINGIQSL, 
LSQDDINGIQSLYGPPPDS, 
5 FFYFFTGSSQLEFDPNAKK, 
EVAPVDYLSQ Y G Y L Q K P L E, 
DYLSQYGYLQKPLEGSNNF, 
RCGLEDPFNQKTLKYLLLG, 
DIRLSFHGRQSSYCSNTFD, 

10 LGLGHSRYSQALMAPVYEG, 
LHPDDVAGIQALYGKKS PV, 
DAALYWPLNQKVFLFKGSG, 
GKVYWRLNQQLRVEKGYPR, 
NYNDFGNYNQQPSNYGPMK, 

15 LPAHLDEELQATLHDFRHQ, 
LQTRGALSLQGSIMTVGEK, 
KMAKMIDERQQELTHQEHR< 
SIAKKIDAAQNWLADPNGG. 
HLEGKIEQAQRWIDNPTVD, 

20 PTVDDRGVGQAAIRGLVAE, 
ANVMMGPYRQDLLAKCDRV, 
QARALASQLQDSLKDLKAR, 
RILLRNPGNQAAYEHFETM, 
KVAMANIQPQMLVAGATSI. 

25 AGNISDPGLQKSFLDSGYR, 
A K V REAFQPQEPDFPPPPP, 
F P P PPPDLEQLRLTDELAP, 
CERIPTISTQLKILSTVKA, 
ATEMLVHNAQNLMQSVKET, 

30 YFTNWSQDRQEPGKFTPEN, 
LIHELAEAFQKDFTKSTKE, 
AKITRLQDQQVPYAVKGNQ, 
DDFTGKSCNQGPYPLVQAV. 
IKEAQPGKKQLLLSAALSA, 

35 ATVHRTLGQQVPYATKGNQ. 
GNQWVGYDDQESVKSKVQY. 
VWALDLDDFQGSFCGQOLR, 
CGWLLGAEAQEPGAPAAGM. 
AECFPACNPQNGFCEDDNV, 

40 NVCRCQPGWQGPLCDQCVT 
NGGTCLQHTQVSYECLCKP 
TPGVHELPVQQPEHRILKV, 
ARKAACACDQKPCSCSKVD 
GERPTLAFLQDVMNILLQY 

45 LQDVMNILLQYVVKSFDRS 
NLEEILMHCQTTLKYAIKT 
LERRILEAKQKGFVPFLVS 
IGSEDGEPPQQRVTGTLVL 
VFSAVLGSLQFGYNIGVIN 

50 YNIGVINAPQKVIEQSYNE 
LRGALGTLNQLAIVIGILI 
GLTVLPALLQLVLLPFCPE 
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WFIVAELFSQGPRPAAMAV, 
TSKGQKCEFQDAYVLLSEK, 
LSEKKISSIQSIVPALEIA, 
DKAQIEKRIQEIIEQLDVT, 
5 EKRIQEIIEQLDVTTSEYE, 
LCSHLEVCIQDGLFGQCQV, 
VCIQDGLFGQCQVGVGQAR, 
IQDGLFGQCQVGVGQARPL, 
VTSPVLQRLQGVLRQLMSQ, 
10 GLSWHDDLTQYVISQEMER, 
I PTGSAPAAQHRLPQPPVG, 
GASSSLSPLQAELLPPLLE, 
EEYGYIVTDQKPLSLAAGV, 
NLSLADVTQQAGLVKSELE, 
15 GLVKSELEAQTGLQILQTG, 
EAQTGLQILQTGVGQREEA, 
TPSWCEEPAQANMDISTGH, 
PRMPAYIATQGPLSHTIAD, 
LSHTIADFWQMVWESGCTV, 
20 QTQETRTLTQFHFLSWPAE, 
AATLEHVRDQRPGLVRSKD, 
RPGLVRSKDQFEFALTAVA, 
HKCSYPWOLQDRYAQDKSV, 
QDKSVVNKMQQRYWETKQA, 
25 DKSVVNKMQQRYWETKQAF, 
MQQRYWETKQAFIKATGKK, 
DLSKAIVLYQKRICFLSQE, 
TGKALCFSSQQRLALRNPL, 
KQMEKFRKVQTQVRLAKKN, 
30 MEKFRKVQTQVRLAKKNFD, 
IiLSHMLATYQTTLLHFWEK, 
YEFTTLKSLQDPMKKLVEK, 
QQESTDAAVQEPSQLISLE, 
LLDQNMKDLQASLQEPAKA, 
35 NMKDLQASLQEPAKAASDL, 
GAVRTENNIQRHFCTSRSI, 
PSGRASTRPQHQIQFDEOM, 
GRASTRPQHQIQFDEDMDS, 
DKYLEDFPKQGPIRLFMEL, 
40 REAEDLQVGQVELGGGPGA, 
NCSVIEGHLQILLMFKTRP, 
KNSRRQGCHQYVIHNNKCI, 
WSKHNLTTTQGKLFFHYNP, 
LMRGLKPWTQYAIFVKTLV, 
45 PISVSNSSSQIILKWKPPS, 
ESEDSQKHNQSEYEDSAGE, 
ENNVVHLMWQEPKEPNGLI, 
KITLLRELGQGSFGMVYEG, 
NPGRPPPTLQEMIQMAAEI, 
50 AIAADSEAEQDSWYQALLQ, 
SEAEQDSWYQALLQLHNRA, 
PGPAFKEVWQVILKPKGLG, 
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HHLNNPPPSQVGLTRRSRT, 
TITHQKTPSQSSVASIEEY, 
MSPKSVSAPQQIINPIRRH, 
SLPRSFKHTQRPGEPEEGA, 
5 PRAREQQQQQQPLLHPPEP, 
yVNIEFGSDQSGYLSGPVA, 
MTMQMSCPRQSYVDTSPAA, 
AHSSLLGGPQGPGGMSAFT, 
DLDLVKDFKQCPQECTPEP, 

10 RASQDSADPQAPAQGNFRG, 
DSADPQAPAQGNFRGSWDC, 
GVS.EAASGSQEKLDFNRNL, 
MEAKDVKGTQESLAEKELQ, 
QESLAEKELQLLVMIHQLS, 

15 DQLLTAHSEQKNMAAMLFE, 
MAAMLFEKQQQQMELARQQ, 
AMLFEKQQQQMELARQQQE, 
QQMELARQQQEQIAKQQQQ, 
QQQEQIAKQQQQLIQQQHK, 

20 QQEQIAKQQQQLIQQQHKI, 
KQQQQLIQQQHKINLLQQQ, 
QQQHKINLLQQQIQQVNMP, 
INLLQQQIQQVNMPYVMIP, 
QPLPVTPDSQLALPIQPIP, 

25 GAMATHHPLQEPSQPLNLT, 
QEKQPYYEEQARLSRQHLE, 
MRTRRQDARQSYVI PPQAG, 
SYVI PPQAGQVQMSSSDVL, 
STSAFRAYGQGTLYDSPLL, 

30 GTLYDSPLLQVSIHLGYGI, 
ITRIALYFVQKGLAVPCCF, 
LSLSVLVSLQGPLFLSYLG, 
SSHQHSRRRQGWLKEIRKL, 
RGVDFNWQAQALLALQEAA, 

35 PSVAEGYASQDVFSATETS, 
GKSAKPRAGQAGLPCDYTA, 
LDTSGLRHVQLAFFPPGTV, 
VQQVKGHYRQAMLLKAMAA, 
LKAMAALEGQDPSGLQLGL, 

40 LEGQDPSGLQLGLTEALHF, 
EGLEAEDWAQGVVEAGGSF, 
GAYGAQEEAQCPTLHFLEG, 
TSFPIDDRVQSHILHLEHD, 
HVTRKNHARQAGVRGLGHQ, 

45 NFALRVLLVQVDVKDPQQA, 
LVQVDVKDPQQALKELAKM, 
YLETYKAYEQKPADLLMEK, 
ADLLMEKLEQDFVSRVTEC, 
VKSVNKTDSQTLLTTFGSL, 

50 IKRLRKKFAQKMLRKARRK, 
SFEPFSNKGQTLVVQFTVK, 
SNKGQTLVVQFTVKHEQNI, 
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and 



4. Peptide sequence of 8 - 40 amino acid residues which comprises at least 
one 8-mer which is a subsequence of at least one of the sequences listed in 
claim 3, wherein said 8-mer comprises the central Q residue of the sequence 

30 listed in claim 3. 

5. Epitope formed by deamidation of one or more of the Q residues of the 
peptide according to claim 3 or 4. 

6. An isolated HLA-DQ restricted T*ceU capable of recognising an epitope 
according to claim 2. 

35 7. An isolated or recombinant HLA-DQ restricted T-oell receptor capable of 
recognising an epitope according to claim 2. 

8. Antibody reactive with a peptide or an epitope according to claim 1 or 2. 

9. Method to screen foodstuffs or plant material for the occurrence of 
peptides according to claim 1 or epitopes according to claim 2 using a T cell 

40 according to claim 6» a T cell receptor according to claim 7 or an antibody 
according to claim 8. 
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10. Method according to claim 9, wherein the foodstuff or plant material is 
pretreated with a protease. 

11. Method according to claim 10, wherein the protease is trypsin, 
chymotrypsin or pepsin. 

12. Method according to any of claims 10 or 11, wherein the foodstuff or 
plant material is additionally treated with tTG. 

13. A diagnostic kit comprising: 

* a T cell according to claim 6 or a T cell receptor according to claim 7 or an 
antibody according to claim 8; and 
- a suitable means of detection. 

14. A diagnostic kit according to claim 13 for detecting in food, food 
components, plant material or biological samples the presence of a peptide or 
epitope according to claim 1 or 2. 

15. An oligonucleotide primer set suitable for use in an amplification assay 
which is capable of binding to a nucleotide sequence coding for a peptide 
according to claim 1, an epitope according to claim 2 or a DNA sequence which 
codes for a peptide which matches a consensus sequence according to the 
foUowing formula (1), (2), (3). (4) or (5): 

(1) X1.X2.X3-Q4-X5-P6-Q7-X8-P9-X10 

in which Xio is F, Y, M, W, I or L and Xi, X2» X3, Xs, and Xs are any amino acid; 

(2) Xl-X2-X3'Q4-X5-P6.Q7.X8-X9.Xl0 

in which X9 is F, Y, M. W, I or L and Xi, X2, Xs, Xs, Xs and Xio are any amino acid; 

(3) Q - Xi - P in which Xi is any amino acid; 

(4) Q - Xi - P - X2 in which Xi is any amino acid and X2 is F,Y,W,M,L,I, or V; 

(5) Q-Xi*-X2-X3 in which Xi and X2 are any amino acid and Xs is 
F.Y.W,M.L.I,orV. 

16. A nucleotide amplification assay wherein food, food components, plant 
material or biological samples are tested for the presence of proteins or 
peptides which match any of the consensus sequences: (1) or (2) as defined in 
claim 15. 



wo 03/066079 



PCT/NL03/00077 



55 

17. Method for the detection of proteins or peptides which match any of the 
consensus sequences: (1) or (2) as de£jied in claim 15 in a nucleotide 
amplification assay in which the primer set of claim 15 is used. 

18. Method for the detection of cereals which are better suited to be used in 
5 food for celiac disease patients by using a method according to claim 9, 10, 11, 

12, 16 or 17. 

19. Method for producing cereals which are better suited to be used in food 
for celiac disease patients by selecting cereals having a low amount of proteins 
or peptides matching the consensus sequences as defined in claim 15 and 

10 crossbreeding them to produce cereals with even lower amounts. 

20. Method for producing cereals which are better suited to be used in food 
for ceUac disease patients by using genetic engineering techniques to diminish 
the natural amount of proteins or peptides matching the consensus sequences 
as defined in claim 15. 

15 21. Method according to claim 20, wherein the genetic engineering 
comprises transformation with recombinant DNA or induced mutation. 

22. Method according to claim 21, wherein the transformation with 
recombinant DNA comprises homologous recombination or gene silencing. 

23. Method according to any of claims 19-22 wherein the cereal is wheat. 
20 24. Cereal plant obtained by any of the methods 19*23. 

25. Use of a cereal according to claim 24 to prepare food for celiac disease 
patients. 

26. A method to identify neo-epitopes firom auto-antigens comprising: 
- comparing the sequence of an auto-antigen and searching for peptide 

25 sequences which match with the consensus sequences as defined in claim 15; 
• screen biological material obtained firom auto-immune disease patients for 
antibodies which bind to the peptide sequences or T cells which bind to the 
HLA-bound peptide sequences as identified in the previous step or peptides 
which are formed firom said peptide sequences by deamidation of one or more 

30 Q residues. 
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27. A method to inhibit the binding of a T cell receptor to an HLA-bound 
epitope according to claim 2 or to an HLA-bound epitope according to daim 5 
comprising providing a blocking substance. 

28. A method to inhibit the binding of the epitopes of claim 2 to HLA-DQ 
5 molecules comprising providing substances that block the binding of said 

epitopes to said HLA-DQ molecules. 

29. A method to inhibit the binding of the epitopes of claim 5 to HLA 
molecules comprising providing substances that block the binding of said 
epitopes to said HLA molecules. 

10 30. Analogue of an epitope according to claim 2 or claim 5, characterised in 
that said analogue is an antagonist for the activity of T cells recognising said 
epitope. 

31. A pharmaceutical composition comprising a peptide according to claims 
1 or 4 or an epitope according to claims 2 or 5. 
15 32. A pharmaceutical composition according to claim 31 for the induction of 
tolerance. 

33. A pharmaceutical composition according to claim 31 for the treament of 
celiac disease or auto-immune disease. 

34. A method to decrease sensitivity to auto-antigens by inhibiting the 
20 deamidation of said auto-antigen. 

35. Method according to daim 34 wherein the inhibition is by inhibiting the 
function of tTG. 

36. Method according to claim 35, wherein the inhibition takes place at the 
site of presence of the auto-antigen. 

25 37. Use of a protease inhibitor for preventing the generation of a peptide 

according to claim 1 or claim 4 or a polypeptide comprising a peptide according 
to claim 1 or claim 4. 

38. Use of acid neutraUsing substances for preventing the generation of a 
peptide according to claim 1 or claim 4 or a polypeptide comprising a peptide 
30 according to claim 1 or daim 4. 
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39. Method to reduce the production of a peptide according to claim 3, 4 or 5 
by inhibiting the expression of the auto-antigen from which it is derived. 

40. A tTG inhibitor/blocker characterised in that said inhibitorA)locker is 
derived from a peptide sequence of claim 1 or 4 and further comprises an 

5 electrophilic trap. 

41. A tTG inhibitorA)locker characterised in that said inhibitorA)locker is 
derived from the peptide QPQLPYPQ and further comprises an electrophilic 
trap. 

42. A pharmaceutical composition comprising a tTG inhibitor/blocker 
10 according to claim 40 or 41 or a functional equivalent and/or functional 

fragment thereof. 

43. Use of a tTG inhibitor/blocker according to claim 40 or 41 or a 
functional equivalent and/or functional fragment thereof for the preparation of 
a medicament for the treatment of celiac disease or auto-immune disease. 
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